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(54) TiUe: ENCODING MULTIWORD INFORMATION BY WORDWISE INTERLEAVING 



(57) Abstract 

Multiword information is based on multibit symbols 
disposed in relative contiguity with respect to a medium, 
and is encoded with a wordwise interleaving and word- 
wise error protection code for providing error locative clues 
across multiword groups. In particular, the clues originate 
in high protectivity clue words (BIS) that are interieaved 
among clue columns, and also in synchronizing columns 
constituted from synchronizing bit groups. The synchro- 
nizing columns are located where the clue columns are 
relatively scarcer disposed. The clues are directed to low 
protectivity target words (LDS) that are interleaved in a 
substantially uniform maimer among target columns which 
form uniform-sized column groups between periodic ar- 
rangements of clue columns and synchronizing columns. 
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Encoding multiword infonnation by wordwise interleaving. 



BACKGROUND OF THE INVENTION 

The invention relates to a method according to the preamble of Claim 1. US 
Patents 4,559,625 to Berlekamp et al, and US 5,299^08 to Blaum et al disclose decoding of 
interleaved and error protected information words, wherein an error pattern found in a first 
5 word may give a clue to locate errors in another word of the same group of words. The 

references use a fault model with multisymbol error bursts across plural words. An error in a 
particular word yields a high probability for an error to occur at a corresponding symbol 
position pointed at in a next word or words. The procedure may raise the number of corrected 
errors. A clue will however only materialize when the clue word has been fully corrected. 
10 Furthermore, the medium stores information as well as synchronization bit groups which 
represent an appreciable amount of redundancy that may also be used to signal errors in the 
target words, so that to some extent clue words may be exchanged for synchronization bit 
groups. Part of the errors is due to so-called bit shps that are more frequent further away from 
the sync bit groups. Therefore, interleaving the clue words amongst clue columns and also 
1 5 interleaving the target words amongst target colunms may further improve the error protection. 
SUMMARY TO THE INVENTION 

In consequence, amongst other things, it is an object of the present invention to 
provide a coding format allowing clue words to cooperate with synchronizing bit groups in a 
systematic format, while also smoothing out possibly negative effects of bit slips more evenly 
20 among the various words. Now therefore, according to one of its aspects the invention is 
characterized as recited in the characterizing part of Claim 1. A clue can point to an erasure 
symbol. Pointing can make enror correction more powerful. In fact, many codes will correct at 
most t errors when no error locative indication is known. Given the erasure locations, 
generally a larger number e>t of erasures may be corrected. Also protection against a 
25 combination of bursts and random errors may improve. The invention may be used both for 
storage and for transmission. 

Advantageously, the method has the features of Claim 2. This is a relatively 
simple arrangement. Alternatively, the number of synchronizing columns may be larger than 
one, and the number of clue colunms may be even if preferred. 
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Advantageously, the method has the features of Claim 3, In video recording, 
user data will relate to the image and accompanying sound to be presented to a user, whereas 
system data may indicate the program name, time, addresses, and various other parameters 
that may be useful without reference to the video or audio proper. This feature allows fast 

5 access to the system data without necessity for decoding the target words. 

Advantageously, the method has the features of Claim 4. If error-ftee, this 
feature will immediately indicate to a user device whether the processed frame of information 
is correct. 

Advantageously, the method has the features of Claim 5. This is a 
10 straightforward organization. Advantageously, the method has the features of Claim 6, This 
has proven to raise the worst case to the average level attainable for this code format. 
Advantageously, the method has the features of Claim 7. Optical storage has been proven a 
fortuitous medium. 

The invention also relates to a method for decoding information so encoded, to 
15 an encoding and/or decoding device for use with the method, and to a carrier containing 
information so encoded. Further advantageous aspects of the invention are recited in 
dependent Claims. 

BRIEF DESCRIPTION OF THE DRAWING 

These and further aspects and advantages of the invention will be discussed 
20 more in detail hereinafter with reference to the disclosure of preferred embodiments, and in 
particular with reference to the appended Figures that show: 
Figures la, lb, a record carrier; 
Figure 2, a playback device; 
Figure 3, a recording device; 
25 Figure 4, a system with encoder, carrier, and decoder; 

Figure 5, a code format principle; 

Figure 6, a schematic representation of a physical cluster on the 

carrier; 

Figure 7, a data frame; 

30 Figure 8, a composition of a data sector from two data frames; 

Figure 9, renumbering data bytes and forming an ECC sector by 

adding parities; 

Figure 10, a composition of an ECC cluster by multiplexing 16 ECC 

sectors; 
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Figure 1 1, a renumbered ECC cluster before interleaving; 
Figure 12, an interleaved ECC cluster; 

Figure 13. multiplexing of the BIS cluster with the interleaved ECC 

cluster; 

Figure 14, a BIS block containing 24 BIS code words; 
Figure 14A, the mapping of the BIS block into the BIS cluster; 
Figure 15, the frame structure for explaining main data interleaving; 
Figure 16, the interleaving proper; 

Figure 17, an example of the partial mapping of BIS bytes on the first 

eight sectors; 

Figure 18, an example of the partial mapping of BIS bytes on the last 

eight sectors; 

Figure 19, a schematic representation of the overall encoding process. 
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Fig. la shows a disc-shaped record carrier 1 1 with track 19 and central hole 10. 
Track 19 is arranged in a spiral pattern of turns forming substantially parallel tracks on an 
information layer. The carrier may be an optical disc with a recordable or prerecorded 
information layer. CD-R, CD-RW, and DVD-RAM are recordable, whereas audio-CD is an 
example of a prerecorded disc. The prerecorded type can be manufactured in a known way by 
first recording a master disc and then pressing consumer discs. On a recordable carrier the 
track is indicated by a pre-embossed track structure provided during manufacture of the blank 
carrier. The track may be structured as a pregroove 14 to enable a read/write head to follow 
the tt-ack 19 during scanning. Information is represented on the information layer by optically 
detectable marks, e.g. pits and lands. 

Fig. lb is a cross-section along line b-b of the record carrier 1 1 of the 
recordable type, transparent substrate 15 being provided with recording layer 16 and protective 
layer 17. Pregroove 14 may be implemented as an indentation or an elevation, or as a material 
property deviating from its surroundings. 

The record carrier may carry user information, that for user convenience has 
been sub-divided into smaller items, that each may have a duration of a few minutes e.g. songs 
of an album or movements of a symphony. Access information for identifying the items can 
also be provided on the carrier, e.g. a so^alled Table Of Contents (TOC) or a file system like 
ISO 9660 for CD-ROM, The access information may include playing time and start address 
for each item, and further information Uke a song titie. Such information may Represent system 
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information. The information is recorded in a digital manner after analog to digital (A/D) 
conversion. 

Figure 2 shows a playback apparatus according to the invention for reading a 
record carrier 11 such as shown in Fig. 1. The device has a drive 21 for rotating tiie record 
carrier 1, and read head 22 for scanning the track on tiie record carrier. The apparatus has 
positioning means 25 for coarse radial positioning of the read head 22. The read head 
comprises an optical system of a known type for generating a radiation beam 24 guided 
through optical elements focussed to a radiation spot 23 on a track of the information layer. 
The radiation beam 24 is generated by a known radiation source. The read head further has a 
focusing actuator for moving tfie focus of the radiation beam 24 along the optical axis of tiie 
beam and a tracking actuator for fine positioning of the spot 23 in a radial direction on the 
track centre. The tracking actuator may comprise coils for radially moving an optical element 
or may be arranged for changing tiie angle of a reflecting element. The radiation reflected by 
Uie information layer is detected by a detector of a usual type, e.g. a four-quadrant diode, in 
the read head 22 for generating a read signal and further detector signals including a tracking 
error and a focusing error signal coupled to said tracking and focusing actuators. The read 
signal is processed by reading means 27 to retrieve the data, which reading means are of a 
usual type for example comprising a channel decoder and an error corrector. The retrieved 
data is sent to data selection facility 28 to select certain information from the data read and 
sends it to buffer 29. The selection is based on data type indicators recorded on the record 
carrier, e.g. headers in a framed format. The compressed information is sent from buffer 29 to 
de-compressor 31 via signal 30. This signal may also be externally available. The de- 
compressor 31 decodes die data to reproduce the original information on output 32. The de- 
compressor may be fitted separately, as indicated by the rectangle 33 in Figure 2. 
Alternatively, the buffer may be positioned before the data selection. Control unit 20 
furthermore receives control commands from a user or from a host computer via control hnes 
26 such as a system bus, connecting drive 21, positioning means 25, reading means 27 and 
data selection means 28, and possibly also buffer 29 for buffer filling level control. Thereto, 
control unit 20 comprises control circuitry, such as a microprocessor, a program memory and 
control gates, or a state machine. 

Compression and de-compression are well-known. During de-compression an 
inverse process is applied to reconstruct tiie original signal. If tiie original digitized signal is 
reconstructed exactly, the (de-)compression is lossless, but in lossy (de)-compression some 
details of the original signal will not be reproduced. Such omitted details are substantially 
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undetectable by the human ear or eye. Most known systems, such as MPEG, use lossy 
compression for audio and video; lossless compression is used from storing computer data. 

The data selection 28 is ananged for retrieving control information from the 
data read, and for discarding any stuffing data that had been added during recording. The 
rotation rate may be adjusted using die average filling level of the buffer 29, e.g. decreasing 
the rotation rate when the buffer is more then 50% full on average. 

Fig. 3 shows a recording device for writing information on a (re)writable record 
carrier 11. During a writing operation, marks representing the information are formed on the 
record carrier. The marks may be in any optically readable form, e.g. in the form of areas 
whose reflection coefficient differs from their surroundings, through recording in materials 
such as dye, alloy or phase change, or in the form of areas with a direction of magnetization 
different from their surroundings, when recording in magneto-optical material. Writing and 
reading of information for recording on optical disks and usable fomiatting, error correcting 
and channel coding rules are well-known in the art, e.g. from the CD system. The marks can 
be formed through a spot 23 generated on the recording layer via a electromagnetic radiation 
beam 24 from a laser diode. The recording device comprises similar basic elements as the 
apparatus for reading described above in Figure 2, i.e. control unit 20, drive means 21 and 
positioning means 25, but it has a write head 39. The information is presented on the input of 
compression means 35, that may be placed in a separate housing. The variable bitrate 
compressed information on the output of the compression means 35 is passed to buffer 36. 
From the buffer 36 the data is passed to data combination means 37 for adding stuffing data 
and further control data. The total data stream to be recorded is passed to writing means 38. 
The write head 39 is coupled to the writing means 38, which comprise a formatter, an error 
encoder and a channel modulator. The data presented to the input of the writing means 38 is 
distributed over logical and physical sectors according to formatting and encoding rules 
discussed infra, and converted into a write signal for write head 39. Unit 20 is arranged to 
control the buffer 36, the data combination means 37 and the writing means 38 via control 
lines 26 and for performing the positioning procedure as described above for the reading 
apparatus. The recording apparatus may be arranged for reading through the features of a 
playback apparatus and a combined write/read head. 

Figure 4 shows a comprehensive system according to the invention, 
provided with an encoder, a carrier, and a decoder. The embodiment is used for encoding, 
storing, and finally decoding a sequence of samples or multibit symbols derived from an audio 
or video signal, or from data. Terminal 120 receives a stream that may have eight bit symbols. 
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Splitter 122 rccunently and cyclically transfers first symbols intended for the clue words to 
encoder 124. and all-other symbols to encoder 126. In encoder 124 the clue words are formed 
by encoding the data into code words of a first multi-symbol error correcting code. This code 
may be a Reed-Solomon code, a product code, an interleaved code, or a combination. In 
encoder 126 the target words are formed by encoding into code words of a second multi- 
symbol error correcting code. In Figure 5, all code words have a uniform length, but this is not 
a restriction. The clue words may have a much higher degree of error protection. This may be 
effected by a larger number of check symbols, by a smaller number of data symbols, or by a 
combination thoeof. 

In block 128, the code words are transferred to one or more outputs of which an 
arbitrary number has been indicated, so that the distribution on a medium to be discussed later 
will become uniform. Block 130 symboUzes the unitary medium such as tape or disc that 
receives the encoded data. This may relate to direct writing in a write-mechanism-plus- 
medium combination. Alternatively, the medium may be a copy ft-om a master encoded 
medium such as a stamp. Storage may be optical and fully serial, but other configurations may 
be used. In block 132. the various words are read again from the medium. Then the clue words 
of the first code will be sent to decoder 134, and decoded as based on their inherent 
redundancies. Furthermore, as will become apparent in the discussion of Figure 5 hereinafter, 
such decoding may present clues on the locations of errors in other than these clue words. 
Moreover, the information ftx)m synchronizing bit columns may be analyzed as to 
interferences therein, to immediately produce additional clues for the target words. Box 135 
receives all clues and contains a program for using one or more different strategies to translate 
such clues to erasure locations. The target words are decoded in decoder 136. With help from 
the erasure locations, the error protection of the target words is raised to a higher level. 
Finally, all decoded words are demultiplexed by means of element 138 to output 140 
conforming to the original format. For brevity, electro-mechanical interfaces between the 
various subsystems were ignored. 

Figure 5 illustrates a simple code format. The coded information has been 
shown arranged in a 480 symbol block with 15 horizontal rows and 32 vertical rows. Storage 
on a medium starts at the top left and proceeds along vertical rows. The hatched region 
contains check symbols: horizontal rows 4, 8, and 12 have 8 check symbols each and 
constitute clue words. The other rows contain 4 check symbols each and constitute target 
words. The whole block has 408 information symbols and 72 check symbols. The latter may 
be localized in a more distributed manner over the respective words. Further to the above, the 
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top horizontal row contains a representation of synchronizing bit groups. These are present on 
the medium to synchronize a reader device to the format, but generally contain neither system 
data nor user data and have a prescribed format with much redundancy. Tlierefore, it is often 
easy to detect an interference, and the occurrence of a single one or a plurality of disturbed 
5 synchronizing bit groups that are physically close to each other or to distributed clue symbols 
can be used to signal occurrence of a burst error. This will produce clues in similar manner as 
the clue words. 

The Reed-Solomon code allows to correct in each clue word up to four symbol 
errors. Actual symbol errors have been indicated by a crosses. In consequence, all clue words 

10 may be decoded correctly, inasmuch as they donot have more than four errors. Notably words 
2 and 3 may however not be decoded on the basis of their own redundancy alone. In the 
Figure, all errors, except 62, 66, 68 represent error strings. Only strings 52 and 58 cross at 
least three consecutive clue words and are considered as error bursts, causing at least all 
intermediate symbol locations to get an erasure flag. Also, target words just before the first 

15 clue word error of the burst and target words just after the last clue word error of the burst may 
get an erasure flag at that location, depending on the strategy followed. String 54 is not 
considered a burst, because it is too short. 

As a consequence, two of the errors in word 4 produce an erasure flag in the 
associated vertical rows. This renders words 2 and 3 correctable, each with one error symbol 

20 and two erasure symbols. However, random errors 62, 68. nor string 54 constitute clues for 
words 5, 6, 7, because each of them contains only one clue word. Sometimes, an erasure cause 
a zero error pattern, because an arbitrary error in an 8-bit symbol has a 1/256 probability to 
cause again a correct symbol. Likewise, a long burst crossing a particular clue word may 
produce a correct symbol therein. By a bridging sti-ategy between preceding and succeeding 

25 clue symbols of the same burst, this correct symbol is then incorporated into the burst, and in 
the same manner as erroneous clue symbols translated into erasure values for appropriate 
target symbols. The above decisions may vary according to decoding policy. The clues by the 
sync bit groups may be used in similar manner as those from the clue words. 

The relevance of the present invention is brought about by newer methods for 

30 digital optical storage. A particular feature is that in the case of substrate incident reading the 
upper transmissi ve layer is as thin as 100 micron. The channel bits have a size of some 0. 14 
micron, so that a data byte at channel rate of 2/3 will have a length of only 1.7 microns. At the 
top surface the beam has a diameter of some 125 microns. A caddy or envelope for the disc 
will reduce the probability of large bursts. Non-conforming particles of less than 50 microns 



wo 00/07300 PCT/EP99/05338 

8 

may cause short faults, although the invention is also useful against longer faults. A fault 
model has been used wherein 50 micron faults through error propagation may lead to bursts of 
200 microns, corresponding to some 120 Bytes. A particular model has fixed size bursts of 
120B that start randomly with a probability per byte of 2.6*10 "^ or on the average one burst 
per 32kB block. Hie invention has been conceived for optical disc storage, but other 
configurations such as multitrack tape, and other technologies such as magnetic and magneto- 
optical would also benefit from the improved ^proach. 
Discussion of a preferred format of ttie information 

Before recording. User Data received ftom a source, that may be an application 
or a host, is formatted in a number of successive steps, that will be explained more in detail 
with reference to Figure 19, as follows: Data Frames, Data Sectors, ECC Sectors, an ECC 
Cluster, a BIS Cluster, a Physical Cluster and Recording Frames. 

The data is recorded in 64k partitions, called Physical Clusters, that each 
contain 32 Data Frames with 2048 bytes of User Data. A Physical Cluster is protected by two 
error correction mechanisms: 

- firstly, a Long Distance (LDS) (248,216,33) Reed-Solomon (RS) error correction code; 

- secondly, the data is multiplexed with a Burst Indicator Subcode (BIS), that consists of 
(62,30,33) Reed-Solomon (RS) codewords. The number of parity symbols is equal for the two 
code categories, which allows to use the same decoder hardware for both cases. The BIS code 
is used to indicate long burst errors, by which the LDS code can more efficiently perform 
erasure corrections. All data is arranged in an array as shown in Figure 6. Note that the 
horizontal and vertical directions have been interchanged with resepect to Figure 5. The array 
is read along a horizontal direction, and is recorded on the disc after inserting synchronization 
patterns and additional d.c. control bits, and after modulation. 

The error correction codes are applied in vertical direction, which gives a good 
basic break-up of burst errors on the disc. Additionally, the LDS codewords have been 
interleaved in a diagonal direction. For the purpose of addressing, a whole Physical Cluster is 
subdivided into 16 Physical Sectors, each consisting of 32 consecutive rows. 

A Data Frame consists of 2052 bytes: 2048 User Data bytes numbered do to 
d2047 and 4 Error Detection Code (EDC) bytes numbered 62048 to eaosi- The bytes e2048 to 02051 
contain an Error Detection Code computed over 2048 bytes of the Data Frame. The Data 
Frame is considered as a field of single bits, starting with the most significant bit of the first 
User Data byte do and ending with the least significant bit of the last EDC byte 62051. The msb 
is bi64i5 and the Isb is bo. Each bit bi of the EDC is shown as follows for i = 0 to 3 1 : 
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EDCix)^J^h,x' l(x)modG(x) 



iB31 



where: l(x) = f^b.x' , G(x) = + x^' + + 1 (D 

ialMlS 



Next, two Data Ftames (A, B) are placed in an array of 19 columns by 216 rows, called a Data 
Sector. The filling of the array is done column by column, starting at the top of the first one 
with byte do^ and ending at the bottom of the last column with byte e2Mi3' cf. Figure 8. 

Next in Figure 9, the bytes in each column of the Data Sector are renumbered 
starting from the top of the column as follows: d^c, dtj .. dy .. to di^is, L being the column 
number (0 .. 18). The ECC Sector is completed by extending each colunrn with 32 Parity bytes 
of a (248,216.33) long distance RS code. The Parity bytes are: Pi^it, Vhan, ... Plj to Pi^t- 

The long distance RS code is defined over the finite field GF(2*). Non-zero 
elements of the finite field GF(2*) are generated by a primitive element a that is a root of the 
primitive polynomial p(X) = X" + X* + + X* + 1. The symbols of GF(2'') are represented 
by 8-bit bytes, using a polynomial base representation with (a', a*, a?, ... a*, a, 1) as a basis. 
The root a is represented as a = 00000010. Each LDS codeword, represented by the vector Ids 
= (duo " dL4 .. dL,2i5 Pu2i6 - Plj -- Pl,247), belongs to a Reed-Solomon code over GF(2\ 
with 216 information bytes and 32 parity bytes. Such codeword can be represented by a 
polynomial Ws(x) of degree 247, that may have some coefficients zero, the highest degrees 
corresponding to the information part of the vector (di^ ■• etc.) and the lowest degrees 
corresponding to the parity part (Pi^w .. etc.). Now, Ids(x) is a multiple of the generator 
polynomial g(x) of the LDS codeword. The generator polynomial is: 



«(Jc) = n(x-a') 

The LDS code is systematic: the 216 information bytes appear unaltered in the 
highest order positions of each codeword. The parity check matrix of code Ids is such that: 
Hlds * Ids*^ = 0 for all LDS codewords Ids. 

The second row Hlds 2 of the parity check matrix Hlds is given by 
Hlds 2 = (o^^ cc^^ a 1) and corresponds to the zero a of the generator polynomial g(x) 
which defines the codeword positions to be used for error locations. 

After generating the LDS codewords in the ECC Sectors, 16 consecutive ECC 
Sectors are combined to one ECC Cluster by multiplexing the 16*19 columns of height 248, 2 
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by 2, including the parities. In this way 152 new columns are fomed with a height of 496 
bytes as shown in Figure 10. The numbering of the bytes is according to dL>i.N where: 
L = 0..18 is the LDS code word number within the ECC Sector 

M = 0.247 is the byte number within the LDS code word 

5 N = ©..IS is the ECC Sector number. 

For further improving the burst error correcting capabilities, extra interieaving 
is introduced by renumbering the bytes are in horizontal direction through all rows of the ECC 
Cluster, cf. Figure 11. Now all rows of an ECC Cluster are shifted two by two over 
mod(k*3,152) bytes to the left, starting from row 2 onwards, the first row being row 0; k = 
10 div(row-number, 2). The bytes that shift out at the left side re-enter the array from the right 
side, cf. Figure 12. After this process the bytes are renumbered once more in horizontal 
direction through all rows, resulting in the numbering Do to D75391 indicated in Figure 6. The 
renumbering of the bytes causes non-uniform mapping of logical addresses on physical 
addresses. Consequences thereof will be discussed infra. 
15 After interieaving, the interieaved ECC Cluster is split into 4 groups of 38 

columns each. In between the 4 groups, 3 columns of one byte width each are inserted. These 
columns carry the address information relating to the data contained in the interieaved ECC 
Cluster. They consist of (62,30,33) RS BIS codewords with 30 bytes of information and 32 
bytes of parity. Due to the high error protection capabilities and an advanced interieaving 
20 scheme, these columns may also offer a reliable indication of burst errors. 

A 3 by 496 byte array formed by the 3 BIS columns from the Physical Cluster, 
is called a BIS Cluster. The content of the BIS Cluster is formed by placing all BIS codewords 
of a BIS Cluster in the 24 columns of a 24 by 62 byte array, cf . Figure 14. 

The BIS RS code is defined over the finite field GF(2*). The non-zero elements 
25 of the finite field GF(2*) are generated by a primitive element a, that is a root of the primitive 
polynomial p(x) = X** + +X^ + X^ + 1. The symbols of GF(2*) are represented by S-bit 
bytes, using the polynomial base representation, with (a^, a*, ... o^, ct, 1) as a basis. The root 
a is represented as a = 00000010, 

Each BIS codeword is represented by a vector bis = (bc,o bcj bc^9j Pbc^ 
30 Pbcj Pbc6i) in a Reed-Solomon code over Gr(2^), with 32 parity bytes and 30 information 
bytes. Such codeword may be represented by a polynomial bis(x) of degree 61, that may have 
some coefficients zero, the highest degrees representing the information part of the vector (bc.o 
.. etc.) and the lowest degrees the parity part of the vector (Pbc^o - etc.). Each codeword is a 
multiple of the generator polynomial g(x) of the BIS codeword: 
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g(x)=n(x.a') 

The BIS code is systematic: the 30 infonnation bytes appear unaltered in the 
highest Older positions of each codeword. The parity check matrix of code bis is such that 
Hbk * bls'^= 0 for all BIS codewords bis. The second row hBis2 of the parity check matrix 
Hbis is given by hsB 2 = iof\ a!" a, 1) It corresponds to the zero a of the generator 
polynomial g(x), and defines the codeword positions to be used for error locations. 

Figure 15 shows the ftame structure for explaining main data interleaving. Hie 
various physical sectors and columns carry appropriate numbering. There are 
304*[248,2 16,33] LDS code words, and 24*[62,30,33] BIS code words. 

A 2kB logical sector contains 9.5 LDS code words in which 2048 User Data 
bytes and 4 EDC bytes have been encoded, and further 22.5 BIS Bytes for storing a header, 4 
ID + 2EDC bytes, copyright data, and 6 Copyright Management Information CPR.MAI. 
Furthermore, 10.5 BIS Bytes have been reserved for possible future use. A 4 kB Physical 
Sector consists of 31 rows, where the header bytes of two 2kB logical sectors are stored in 
physically prescribed positions. Figure 16 shows the interieaving proper. Here, the i-th UDS 
code woKl with 0 ^ i ^ 303, contains 248 bytes d J. where j is of the form: j = [(i mod 2) + 
2*a]*lS2 + {[(I div 2) - 3*a ] mod 152} for certain values 0 ^ a S 247. Various locations are 
feasible for putting the parities, such as at the bottom, or permutated according to a + 31 *i, a 

Next, the generating of a BIS cluster will be discussed. After 
generating the BIS codewords, the BIS Block is mapped in an interleaved manner on an array 
of 496 = (16 X 31) rows * 3 columns. This new array is called a BIS Cluster shown in Figure 
14A. The positioning of the bytes from a BIS Block (Figure 14) in a BIS Cluster will first be 
given by mathematical expressions. For this purpose, the BIS Cluster is subdivided according 
to the Physical Sectors shown in Figure 6. The sectors are numbered s = 1 ...15, the rows in 
such sector are numbered r = 0... 30, and the columns are numbered e = 0 .. 2, cf. Figure 14a. 
Now byte bN,c gets the following position: 

sector number 8= mod {[div(N^) + 8 - div(C^)] , 8 } + 8 » mod (N^) 

row number r = dlv(N^) 

column number e = mod {[C + div(N,2)] , 3} 

The byte number m gives the sequence number Bm as the Physical 
cluster is written to the disc as shown in Figure 6 according to m = (s * 31 + r) * 3 + e. The 
essentials of the interieaving scheme are exemplified in Figures 17, 18, and pertain to: 
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each row of the BIS Block is spUt into 8 groups of 3 bytes each, which three-byte groups arc 
each placed in a respective row of the BIS Cluster. 

the even rows of the BIS Block are mapped on Sectors 0 through 7, the odd rows of the BIS 
Block are mapped on Sectors 8 through 15. 

the eight three-byte groups from an even row of the BIS Block arc each put on the same row 
of eight successive sectors, using the Sectors in a direction that is reverse to their numbering. 
It has been found that this reversal is effective for better dispersing of burst errors. The starting 
sector for each row of the BIS Block is one sector higher than for the previous row. 

- row N = 0 of flie BIS block is placed on rows r = 0 of the sectors 0, 7, 6, 5, ... 2, 1. 

- row N = 2 of the BIS block is placed on rows r = 1 of the sectors 1, 0, 7, 6, ... 3, 2. 

- row N = 4 of the BIS block is placed on rows r = 2 of the sectors 2, 1 , 0, 7, ... 4, 3. 

- this procedure is cyclically repeated until row N = 60, that is placed on the rows r = 30 of the 
sectors 6, 5, 4, ... 0, 7. 

Now, within each sector, each row is shifted cyclically to the right by raod(r, 3) positions: so, 
row 

r = 0 is not shifted at all. row r = 1 is shifted 1, row r = 2 is shifted 2, row r = 3 is not shifted, 
row 

r = 4 is shifted 1, etc. For the odd rows of the BIS Block a corresponding procedure is 
followed. 

In this respect. Figure 17 shows an example of the partial mapping of 
BIS bytes on the first eight sectors, and Figure 18 an example of the partial mapping of BIS 
bytes on the last eight sectors, that after the foregoing are self-explanatory. 

Figure 19 by way of recapitulation gives a schematic representation of 
the overall encoding process. The user data as received from a source that may be a host or an 
application are first divided into data firames that each consist of 2048+4 bytes; as shown in 
block 200 of Oie Figure, 32 of these firames are taken into account for the next encoding step. 
In block 202, a Data Block is formed and arranged into 304 columns of 216 rows each. In 
block 204 a Long Distance Code block is formed through adding 32 rows parity. In block 206 
an ECC cluster is airanged according to 152 columns and 496 rows. TOs is arranged to fill the 
four sections labeled ECC in the Physical Cluster block 218, that is the comprehensive code 
format entity. 

The address and control data added by the Recording System are converted also 
in successive steps. First the Logical Address and Control Data is arranged in 32 * 18 bytes in 
block 208. The logical addresses are those that pertain to user fiinctionalities, and may indicate 
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aspects that relate to a duration of the rendering of a user program. Also the physical addresses 
arc arranged into 16 * 9 bytes in block 210.The physical addresses relate to physical distances 
on the carrier. Due to the repeated renumbering and interieaving, the relation between physical 
and logical addresses has been broken. Items that follow each other closely in a program, may 

5 be spaced from each other by an appreciable physical distance, and vice versa. Also, the 
mapping is not uniformly progressing. In block 212, the addresses are combined in an Access 
Block of 24 columns by 30 rows. In block 214, there are 32 rows of parity added. In block 
216, these arranged into a BIS cluster of 3 columns and 496 rows. These fill the three BIS 
columns in block 218. Also a column of sync bit groups is added, so tfiat a physical cluster of 

10 155 columns by 496 rows is formed. Together these form 16 Physical Sectors that arc grouped 
into 496 Recording Frames as shown. 
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1 . A method for encoding multiword information that is based on multibit 
symbols disposed in relative contiguity with respect to a medium, whilst providing wordwise 
interleaving and wordwise error protection code facilities, for so providing error locative clues 
across multiword groups, 

5 characterized by originating such clues both in high protectivity clue words 

(BIS) that are interleaved among clue columns and also in synchronizing columns constituted 
from synchronizing bit groups and locating said synchronizing columns where said clue 
columns are relatively scarcer disposed, said clues being directed to low protectivity target 
words (LDS) that are interleaved in a substantially uniform manner among target columns 

10 which form uniform-sized column groups between periodic arrangements of clue columns and 
synchronizing colunms. 

2. A method as claimed in Claim I, and arranging said information into uniform- 
sized physical clusters that each have a single synchronizing colunm and an odd number of 

IS clue columns. 

3. A method as claimed in Claim 1, whilst assigning user data exclusively to said 
target columns and assigning system data at least predominantly to said clue columns. 

20 4. A method as claimed in Claim 1, wherein at a lowest level a multi-symbol data 

frame of target symbols contains a multi-symbol but bit-organized error detection bit group 
EDC. 

5. A method as claimed in Claim 4, wherein at a next higher level an ECC sector 

25 contains a plurality of data frames for distribution over a plurality of target words through 
addition of Reed-Solomon redundancy. 



wo 00/07300 PCT/EP99/05338 

15 

6. A method as claimed in Claim 5, while before said interleaving sequentially 
separating various code word blocks as a speeding up measure with respect to later decoding 
(Figure 11). 

7. A method as claimed in Claim 1, whilst superposing said interleaving on row- 
wise incrementally rotating of target symbols within their cluster. 

8. A method as claimed in Claim 1 applied to storage on an optical medium. 

9. A method as claimed in Claim 1, wherein all clue words and target words carry 
a uniform amount of redundancy, but target words have more data symbols than clue words. 

10. A method as claimed in Claim 1 , wherein the clue words of a comprehensive 
Physical Storage Cluster number a multiple (24/3) of the number of clue columns in the 
Physical Cluster; like-numbered symbols of the clue words being distributed into groups of as 
many symbols as there are clue columns in a Physical Cluster over like-ranked Recording 
Frames of different Physical Sectors of the Physical Cluster, in a staggered but otherwise 
uniform interleaving schedule among the various Recording Frames. 

11. A method as claimed in Claim 10, wherein even-numbered clue symbol rows 
arc assigned to a first contiguous group of half of the Physical Sectors, and odd-numbered clue 
symbol rows arc assigned to the second contiguous group of half of the Physical Sectors. 

12. A method as claimed in Claim 10, whercin the rows of clue column symbols 
are subjected to a staggered and systematic rotating among the various clue columns. 

13. A method as claimed in Claim 10, whilst assigning to clue words both logical 
and physical address data pertaining to the actual Physical Cluster. 

14. A method for decoding multiword information that is based on multibit 
symbols disposed in relative contiguity with respect to a medium, having wordwise 
interleaving and word-wise error protection code facilities, and so providing error locative 
clues across multiword groups as a preliminary measure before actually decoding. 
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characterized by deriving such clues both from high protectivity clue words that 
are interleaved among clue columns and also from synchronizing columns constituted from 
synchronizing bit groups through accessing said synchronizing columns where said clue 
columns are relatively scarcer disposed, said clues being directed to low protectivity target 
words that are interleaved in a substantially uniform manner among target colunms which 
form uniform-sized column groups between periodic arrangements of clue columns and 
synchronizing columns. 

15. A method as claimed in Claim 14, and accessing said information according to 
uniform-sized physical clusters that each have a single synchronizing colunm and an odd 
number of clue columns. 

16. A method as claimed in Claim 14, whilst deriving user data exclusively from 
said target columns and system data at least substantially exclusively from said clue columns. 

17. A method as claimed in Claim 14, and accessing at a lowest level within a 
multi-symbol data frame of target symbols a multi-symbol but bit-organized error detection bit 
group, as a preliminary decoding step. 

18. A method as claimed in Claim 17, whilst at a next higher level accessing within 
a data sector a plurality of respective data frames distributed over a plurality of target words 
through evaluating added Reed-Solomon redundancy. 

19. A method as claimed in Claim 14, whilst correcting for on said interieaving 
superposed a row-wise incremental rotating of target symbols within their cluster. 

20. A method as claimed in Claim 14 applied to storage on an optical medium. 

21. A method as claimed in Claim 14, whilst decoding all clue words and target 
words as based on a uniform amount of redundancy therein, whilst allowing in target words 
for more data symbols than in clue words. 



22. A method as claimed in Claim 14, wherein the clue words of a comprehensive 

Physical Storage Cluster number a multiple (24/3) of the number of clue columns in the 
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Physical Cluster, like-numbered symbols of the clue words being derived from groups of as 
many symbols as there arc clue columns in a Physical Cluster over like-ranked Recording 
Frames of different Physical Sectors of the Physical Cluster, in a staggered but otherwise 
uniform deinterleaving schedule among the various Recording Frames. 

23. A method as claimed in Claim 22, wherein even-numbered symbol rows are 
derived from a first contiguous group of half of the Physical Sectors, and odd-numbered 
symbol rows are derived from the second contiguous group of half of the Physical Sectors. 

24. A method as claimed in Claim 22, wherein the rows of clue column symbols 
are derived from staggered and systematically rotated symbols along the various clue columns. 

25. A method as claimed in Claim 22. whilst deriving from clue words both logical 
address data and physical address data pertaining to the actual Physical Cluster. 

26. A device for encoding multiword information that is based on multibit symbols 
disposed in relative contiguity with respect to a medium, through providing wordwise 
interieaving means and wordwise error protection encoding means, for so providing error 
locative clues across multiword groups, 

characterized by said encoding means directing such clues both to high 
protectivity clue words that are interieaved among clue columns and also to synchronizing 
columns constituted from synchronizing bit groups through locating means for locating said 
synchronizing columns where said clue columns are relatively scarcer disposed, all clues being 
directed to low protectivity target words that are interieaved in a substantially uniform manner 
among target columns which form uniform-sized column groups between periodic 
arrangements of clue columns and synchronizing columns. 

27. A device as claimed in Claim 26, and having arranging means for arranging 
said information into uniform-sized physical clusters that each have a single synchronizing 
column and an odd number of clue columns. 

28. A device as claimed in Claim 26, being arranged for asssigning user data 
exclusively to said target columns and system data at least substantially exclusively to said 
clue columns. 
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29. A device as claimed in Claim 26, having generator means for at a lowest level 
in a multi-symbol data ftame of target symbols also generating a multi-symbol but bit- 
organized oTor detection bit group. 

5 

30. A device as claimed in Claim 29, arranged for at a next higher level forming a 
data sector to contain a plurality of data frames for distributed over a plurality of target words 
thru added Reed-Solomon redundancy. 

10 3 1 . A device as claimed in Claim 26, and having superposing means for 

superposing said interleaving on row-wise incrementally rotating of target symbols within 
their cluster. 

32. A device as claimed in Claim 26, and having interfacing means for interfacing 
15 to an optical storage medium. 

33. A device as claimed in Claim 26, wherein said encoding means are arranged for 
assigning to all clue words and target words to carry a uniform amount of redundancy, but to 
target words a lar^r amount of data symbols than to clue words. 

20 

34. A device as claimed in Claim 26, wherein the clue words of a comprehensive 
Physical Storage Cluster number a multiple (24/3) of the number of clue columns in a Physical 
Cluster; said device having having distributing means for distributing like-numbered symbols 
of the clue words in groups of as many symbols as there arc clue columns in a Physical 

25 Cluster, over like-ranked Recording Frames of different Physical Sectors of the Physical 
Ouster, in a staggered but otherwise uniform interleaving schedule among the various 
Recording Frames. 

35. A device as claimed in Claim 26, having assigning means for assigning even- 
30 numbered symbol rows to a first contiguous group of half of the Physical Sectors, and odd- 
numbered symbol rows to the second contiguous group of half of the Physical Sectors. 
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36. A device as claimed in Claim 26, and having rotating means for subjecting the 

rows of clue columin symbols to a staggered and systematic rotating among the various clue 
columns. 

5 37. A device as claimed in Claim 26, and having address assigning means for 

assigning to clue words both logical and physical address means pertaining to the actual 
Physical Cluster. 

38. A device for decoding multiword information that is based on multibit symbols 
10 disposed in relative contiguity with respect to a medium, through effecting wordwise 

deinterleaving and wordwise error code protecting facilities, and so providing error locative 
clues across multiword groups, 

characterized by being arranged for deriving such clues both from high 
protectivity clue words tiiat are interleaved among clue columns and also from synchronizing 
15 columns constituted from synchronizing bit groups, through accessing means for accessing 
said synchronizing columns where said clue columns are relatively scarcer disposed, all clues 
being directed to low protectivity target words that are interleaved in a substantially uniform 
manner among target columns which form uniform-sized colunm groups between periodic 
arrangements of clue columns and synchronizing columns. 

20 

39. A device as claimed in Claim 38, and having accessing means for accessing 
said information according to uniform-sized physical clusters that each have a single 
synchronizing column and an odd number of clue columns. 

25 40. A device as claimed in Claim 38, being arranged for deriving user data 

exclusively from target columns and system data at least substantially exclusively from clue 
columns. 

41. A device as claimed in Claim 38, and having accessing means for accessing at a 
30 lowest level witfiin a data frame a multi-symbol but bit-organized error detection bit group, for 

tiierefrom deriving an error detecting signal. 

42. A device as claimed in Claim 41, wherein said accessing means are arranged 
for at a next higher level accessing within a data sector a plurality of respective data frames 
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distributed over a plurality of target words through evaluating added Reed-Solomon 
redundancy. 

43. A device as claimed in Claim 38, having corrector means for superposing on 

5 said interleaving a row-wise incrementally backrotating of target symbols within their cluster. 

44. A device as claimed in Claim 38, and having interface means for interfacing to 
an optical storage medium. 

10 45. A device as claimed in Claim 38, said decoding means being arranged for 

decoding all clue words all and target words through a uniform amount of redundancy, but for 
deriving from target words a larger amount of data symbols than from clue words. 

46. A device as claimed in Claim 38, wherein the clue words of a comprehensive 
15 Physical Storage Cluster number a multiple (24/3) of the number of clue columns in the 

Physical Cluster; and being arranged for deriving like-numbered symbols of the clue words as 
being distributed into groups of as many symbols as there are clue columns in a Physical 
Cluster from like-ranked Recording Frames of different Physical Sectors of the Physical 
Cluster, in a staggered but otherwise uniform interieaving schedule among the various 
20 Recording Frames. 

47. A device method as claimed in Claim 46, and being arranged for deriving even- 
numbered symbol rows from a first contiguous group of half of the Physical Sectors, and odd- 
numbered symbol rows from the second contiguous group of half of the Physical Sectors. 

25 

48. A device as claimed in Claim 46, being arranged for deriving the rows of clue 
column symbols through staggered and systematic backrotating over the various clue columns. 

49. A device as claimed in Claim 38. being arranged for deriving both logical and 
30 physical address data from the clue columns as pertaining to the actual Physical Cluster. 

50. A unitary storage carrier produced by a method as claimed in Claim 1 , and 
storing multiword information that is based on multibit symbols disposed in relative contiguity 
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thereon, with wordwise interleaving and wordwise error protection code facilities that provide 
enx)r locative clues across multiword groups, 

characterized by originating such clues both in high protectivity clue words that 
are interleaved among clue columns and also in synchronizing columns constituted from 
synchronizing bit groups and locating said synchronizing columns where said clue columns 
arc relatively scarcer disposed, all clues being directed to low protectivity target words that are 
interleaved in a substantially uniform manner among target columns which form uniform- 
sized column groups between periodic arrangements of clue columns and synchronizing 
columns. 

51 . A carrier as claimed in Claim 50, having arranged said information into 
uniform-sized physical clusters that each have a single synchronizing column and an odd 
number of clue columns. 

52. A carrier as claimed in Claim 50, having user data exclusively contained in 
target columns and system data at least substantially exclusively contained in clue columns. 

53. A carrier as claimed in Claim 50, wherein at a lowest level a multi-symbol data 
frame of target symbols also contains a multi-symbol but bit-organized error detection bit 
group. 

54. A carrier as claimed in Claim 53, wherein at a next higher level a data sector 
contains a plurality of data frames distributed over a pluraHty of target words through added 
Reed-Solomon redundancy. 

55. A carrier as claimed in Claim 54, having superposed said interieaving on row- 
wise incrementally rotation of target symbols within their cluster. 

56. A carrier as claimed in Claim 50, and based on an optical storage medium. 



57. A carrier as claimed in Claim 50, wherein all clue words and target words carry 

a uniform amount of redundancy, but target words have more data symbols than clue words. 
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58. A carrier as claimed in Claim 50, wherein the clue words of a comprehensive 
Physical Storage Cluster number a multiple (24/3) of the number of clue columns in the 
Physical Cluster; like-numbered symbols of the clue words being distributed into groups of as 
many symbols as there are clue colunms in a Physical Clusta* over like-ranked Recording 
Frames of different Physical Sectors of the Physical Cluster, in a staggered but otherwise 
uniform interleaving schedule among the various Recording Frames. 

59. A carrier as claimed in Claim 58, wherein even-numbered symbol rows are 
assigned to a first contiguous group of half of the Physical Sectors, and odd-numbered symbol 
rows are assigned to the second contiguous group of half of the Physical Sectors. 

60. A carrier as claimed in Claim 58, wherein the rows of clue column symbols are 
located as subjected to a staggered and systematic rotating among the various clue columns. 

61. A carrier as claimed in Claim 58, wherein clue columns contain both logical 
and physical address data as pertaining to the actual Physical Cluster. 
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